Stack access control for memory device

ABSTRACT

Apparatuses and methods including an interface die that interfaces with dice through memory channels are described. An example apparatus includes a first die. The first die receives a first command including first command information and second command information provided after the first command information. The first die changes an order of providing the first command information and the second command information and provides a second command to a second die, the second command including the second command information and the first command information provided after the second command information in the changed order. The first command information is related to a command function and the second command information is related to a destination of the command function.

CROSS-REFERENCE TO RELATED APPLICATION(S)

This application is a continuation of U.S. patent application Ser. No.15/176,442, filed Jun. 8, 2016. The aforementioned application isincorporated by reference herein, in its entirety, and for any purposes.

BACKGROUND

High data reliability, high speed of memory access, lower powerconsumption and reduced chip size are features that are demanded fromsemiconductor memory. In recent years, three-dimensional (3D) memorydevices have been introduced. Some 3D memory devices are formed bystacking dice vertically and interconnecting the dice usingthrough-silicon (or through-substrate) vias (TSVs). Benefits of the 3Dmemory devices include shorter interconnects which reduce circuit delaysand power consumption, a large number of vertical vias between layerswhich allow wide bandwidth buses between functional blocks in differentlayers, and a considerably smaller footprint. Thus, the 3D memorydevices contribute to higher memory access speed, lower powerconsumption and chip size reduction. Example 3D memory devices includeHybrid Memory Cube (HMC) and High Bandwidth Memory (HBM).

For example, High Bandwidth Memory (HBM) is a type of memory including ahigh-performance random access memory (DRAM) interface and verticallystacked DRAM. FIG. 1 is a wiring diagram of a High Bandwidth Memory(HBM) 1 and a processor 2. For example, the processor 2 may be agraphical processor unit. The HBM 1 may include terminals coupled byballs 3 (e.g., microbumps) to an interposer 5. The processor 2 mayinclude terminals coupled by balls 4 (e.g., microbumps) to theinterposer 5 and further to the corresponding terminals of the HBM 1through the interposer 5. The interposer may be stacked on a packagingsubstrate (not shown) by balls 6. For example, the interposer 5 may bemade of silicon.

FIG. 2A is a schematic diagram of an HBM stack including an interface(I/F) die 22 and a plurality of core dies 23. For example, the number ofthe plurality of core dies 23 in the HBM stack 21 may be four. FIG. 2Bis a schematic diagram of a portion of the HBM stack 21. The I/F die 22and the plurality of core dies 23 may be coupled by a plurality ofconductive vias 27 (e.g., through silicon (substrate) via (TSV)). TheI/F die 22 may be on the balls 3. For example, a combination of theconductive vias 27 and the balls 3 may function as interconnects. FIG.2C is a schematic diagram of the HBM stack 21 including the I/F die 22and the plurality of core dies 23. The HBM stack 21 may have two 128-bitchannels per core die for a total of eight input/output channels and awidth of 1024 bits in total. For example, each core die of the pluralityof the core dies 23 may include two channels. In this example, the coredies 23 a, 23 b, 23 c and 23 d include channels A and C, channels B andD, channels E and G, and channels F and H, respectively. For example, aclock frequency, a command sequence, and data can be independentlyprovided for each channel.

FIG. 4A is a wiring diagram of the HBM stack 21 including the I/F die 22and the plurality of core dies 23. The I/F die 22 of the HBM 21 providesinterfaces 28 a, 28 b, 28 e and 28 f which provide signals on fourinput/output channels among the eight input/output channels, whichfunction independently of each other. Memory arrays of the channel A,channel B, channel E and channel F of the core dies 23 a, 23 b, 23 c and23 d may be coupled to the I/F die 22 via native input/output lines(IOs) 27 a, 27 b, 27 e and 27 f, respectively. For example, the nativeIOs 27 a to 27 f may be implemented as conductive vias. For example, theconductive vias may have a spiral structure. Each core die 23 mayinclude a command circuit for each channel. For example, the core dies23 a to 23 d may include command circuits 26 a to 26 d for channel A,channel B, channel E and channel F, respectively. Thus, clock signals,command signals and data signals for each channel may be transmittedindependently and a plurality of data buses and their respectivechannels can operate individually.

FIG. 3A is a schematic diagram of an HBM stack 31 including an interface(I/F) die 32 and a plurality of core dies 33. For example, the number ofthe plurality of core dies 33 in the HBM stack 31 may be eight. FIG. 3Bis a schematic diagram of the HBM stack 31 including the I/F die 32 andthe plurality of core dies 33. The HBM stack 31 may have two 128-bitchannels per core die for a total of eight input/output channels and awidth of 1024 bits in total. For example, each core die of the pluralityof the core dies 33 may include two channels. In this example, a stackgroup 34 a having a stack identifier (SID) “0” includes the core dies 33a, 33 b, 33 c and 33 d including channels A and C, channels B and D,channels E and C, and channels F and H, respectively. A stack group 34 bhaving a stack ID (SID) “1” includes the core dies 33 e, 33 f, 33 g and33 h including channels A and C, channels B and D, channels E and G, andchannels F and H, respectively. Thus, a destination die among aplurality of core dies in each channel (e.g., core dies 33 a and 33 e ofchannel A) addressed in a command may be identified by the SID.

FIG. 4B is a wiring diagram of the HBM stack 31 including the I/F die 32and the plurality of core dies 33. The I/F die 32 of the HBM 31 providesinterfaces 38 a, 38 b, 38 e and 38 f which provide signals on fourinput/output channels among the eight input/output channels of two stackgroups. Memory arrays of channels A, B, E and F of the stack group 34 aand memory arrays of channels A, B, E and F of the stack group 34 b maybe coupled to the same native input/output lines (IOs) 37 a, 37 b, 37 eand 37 f, respectively. For example, memory arrays of channel A of thecore die 33 a in the stack group 34 a and memory arrays of channel A ofthe core die 33 e in the stack group 34 b may be coupled to the nativeIO 37 a. Each core die 33 may include a command circuit for eachchannel. For example, the core dies 33 a to 33 d in the stack group 34 amay include command circuits 36 a to 36 d for channel A, channel B,channel E and channel F, respectively. The core dies 33 e to 33 h in thestack group 34 b may include command circuits 36 e to 36 h for channelA, channel B, channel E and channel F, respectively. Each commandcircuit 36 may detect the SID in a command, check whether the SID in thecommand matches with an SID of the stack group of the core die 33including the command circuit 36, and decode the command if the SIDmatches and memory access actions responsive to the command may beperformed. For example, when the interface 38 a transmits a command onthe input/output line 37 a, the command circuit 36 a receives thecommand and check whether the SID in the command is “0”. The commandcircuit 36 a processes the command if the SID is “0” and ignores thecommand if the SID is “1”. The command circuit 36 e also receives thecommand and check whether the SID in the command is “1”. The commandcircuit 36 e processes the command if the SID is “1” and ignores thecommand if the SID is “O”. Thus, clock signals, command signals and datasignals for each channel on each die may be transmitted independently.

FIG. 5 is a command truth table of various combinations of a clockcycle, a clock enable signal, row command/address signals to be providedto the HBM 1. For example, a command circuit for each channel on eachdie may receive a plurality of row command/address signals R[5:0], theCKE signal and the clock signals. In the command truth table, “H”represents a logic high signal, “L” represents a logic low signal,RA[15:0] represents a row address, BA[3:0] represents a bank address,“PAR” represents parity information, and “V” represents a correspondingbit that can be either “H” or “L” which is a defined logic high or lowlevel. Functions of row commands may include Row No Operation (RNOP),Activate (ACT), Precharge (PRE), Precharge All (PREA), Single BankRefresh (REFSB); Refresh (REF), Power Down Entry (PDE), Self RefreshEntry (SRE) and Power Down & Self Refresh Exit (PDX/SRX). The SID may beprovided at a rising edge of the ACT command, at a falling edge of R[l]of the PRE command or the REFSB command.

FIG. 6 is a command truth table of various combinations of a clockcycle, a clock enable signal, column command/address signals to beprovided to the HBM 1. Description of components corresponding tocomponents included in and previously described with reference to FIG. 5will not be repeated. For example, a command circuit for each channel oneach die may receive a plurality of column command/address signalsC[7:0], the CKE signal and the clock signals. In the command truthtable, CA[6:0] represents a column address and OP[6:0] representsoperands to be written. Functions of column commands may include ColumnNo Operation (CNOP), Read (RD), Read w/AP (RDA); Write (WR), Write wi AP(WRA), and Mode Register Set (MRS). As shown in FIG. 6, the SID may beprovided at a falling edge of R[1] of the RD command, the RDA command,the WR command, or the WRA command. The RDA command or WDA command withauto-precharge may be used when an auto-precharge occurs to a bankassociated with the command. As shown in FIGS. 5 and 6, the CKE signalis active (e.g., “H”) while a command is being provided. As earliermentioned, each command circuit may detect the SID in a command, andcheck whether the SID in the command matches with an SID of the stackgroup of the core die of the command circuit. The SID may be included inthe falling edge of the clock cycle of the commands (e.g., PRE, REFSB,RD, RDA, WR and WRA). For example, as shown in FIG. 4B, when theinterface 38 a transmits a command on the input/output channel 37 a, thecommand circuit 36 a receives the command and checks whether the SID inthe command is “0” or “1”.

FIG. 7 is a timing diagram of clock signals and command signals to beprovided to a portion of dies in the HBM stack 31. For example, theportion of dies may be an I/F die 32 die, Core 1 die 33 a, and Core 5die 33 e in FIG. 4B. For example, the timing diagram of FIG. 7 includesa clock signal CK_t and column command signals C[7:0] received at theI/F die 32, a clock signal CK_t_0 and column command signals C_0[7:0]received at the Core 1 die 33 a which processes a command for channel Ain a stack group with SID=“O”, and a clock signal CK_t_1 and columncommand signals C_1 [7:0] received at the Core 5 die 33 e whichprocesses a command for channel A in a stack group with SID=“1”. The I/Fdie 32 receives a command from a first clock cycle of the clock signalCK_t at time T0. The I/F die 32 may capture an SID included in thecommand at a falling edge of the first clock cycle of the CK_t signal attime T1. The Core 1 die 33 a may capture the SID at a falling edge of afirst clock cycle of the CK_t_0 signal at time T2. The core 5 die 33 ereceives the SID at a falling edge of a first clock cycle of the CK_t_lsignal at a time T3. There may be a propagation delay from the I/F die32 to the Core 1 die 33 a represented by “T2-T1.” There may be apropagation delay from the Core 1 die 33 a to the Core 5 die 33 erepresented by “T3-T2.” The command circuits 36 a and 36 e Core 1 die 33a and the Core 5 die 33 e wait for the SID until the falling edge of thefirst clock cycle and determine whether the SID corresponds to the coredie of the command circuit. When a command is issued to the Core 1 die33 a, the command related signals may be transmitted to the Core 5 die33 e, because the Core 1 die 33 a may capture the SID at time T2 afterthe first clock cycle of the commands for the Core 5 die 33 e may betransmitted. The command circuit 36 a of the Core 1 die 33 a may not beable to determine whether the command is for the Core 1 die 33 a or forthe Core 5 die 33 e until capturing the SID. The command circuit 36 e ofthe Core 5 die 33 e may not be able to determine whether the command isfor the Core 5 die 33 e until capturing the SID. If the propagationdelay may be about half a clock cycle, the SID may be captured by thecommand circuit 36 e about a propagation delay of a clock cycle. Thus,command signals unnecessary for the Core 5 die 33 e may be transmitteduntil the SID is captured at time T3.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a wiring diagram of a High Bandwidth Memory (HBM) and aprocessor.

FIG. 2A is a schematic diagram of an HBM stack including an interface(I/F) die and a plurality of core dies.

FIG. 2B is a schematic diagram of a portion of the HBM stack.

FIG. 2C is a schematic diagram of the HBM stack including the I/F dieand the plurality of core dies.

FIG. 3A is a schematic diagram of an HBM stack including an interface(I/F) die and a plurality of core dies.

FIG. 3B is a schematic diagram of the HBM stack including the I/F dieand the plurality of core dies.

FIG. 4A is a wiring diagram of the HBM stack including an L/F die and aplurality of core dies.

FIG. 4B is a wiring diagram of the HBM stack including an I/F die and aplurality of core dies.

FIG. 5 is a command truth table of various combinations of a clockcycle, a clock enable signal, row command/address signals to be providedto the HBM.

FIG. 6 is a command truth table of various combinations of a clockcycle, a clock enable signal, column command/address signals to beprovided to the HBM.

FIG. 7 is a timing diagram of clock signals and command signals to beprovided to a portion of dies in the HBM stack.

FIG. 8 is a block diagram of the HBM in a semiconductor device inaccordance with an embodiment of the present disclosure.

FIG. 9 is a block diagram of a command control circuit on an I/F die ofan HBM in a semiconductor device in accordance with an embodiment of thepresent disclosure.

FIG. 10 is a timing diagram of clock related signals and column commandsignals in the command control circuit in FIG. 9, in accordance with anembodiment of the present disclosure.

FIG. 11 is a block diagram of a portion of a command control circuit onan I/F die of an HBM in a semiconductor device in accordance with anembodiment of the present disclosure.

FIGS. 12A and 12B are timing diagrams of clock related signals andcolumn command signals in the HBM in FIG. 8, in accordance with anembodiment of the present disclosure.

FIG. 13 is a block diagram of a portion of a core die in the HBM in asemiconductor device in accordance with an embodiment of the presentdisclosure.

FIG. 14 is a block diagram of an output buffer control circuit on thecore die in FIG. 13 in accordance with an embodiment of the presentdisclosure.

FIG. 15 is a timing diagram of clock signals, command signals and datasignals to be provided to a portion of dies in a write operation, inaccordance with an embodiment of the present disclosure.

FIG. 16 is a block diagram of the HBM in a semiconductor device inaccordance with an embodiment of the present disclosure.

FIG. 17 a block diagram of a write SID counter in a core die, inaccordance with an embodiment of the present disclosure.

FIG. 18 is a timing diagram of clock related signals and column commandsignals in the HBM in FIG. 8, in accordance with an embodiment of thepresent disclosure.

DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS

Various embodiments of the present invention will be explained below indetail with reference to the accompanying drawings. The followingdetailed description refers to the accompanying drawings that show, byway of illustration, specific aspects and embodiments in which thepresent invention may be practiced. These embodiments are described insufficient detail to enable those skilled in the art to practice thepresent invention. Other embodiments may be utilized, and structure,logical and electrical changes may be made without departing from thescope of the present invention. The various embodiments disclosed hereinare not necessary mutually exclusive, as some disclosed embodiments canbe combined with one or more other disclosed embodiments to form newembodiments.

FIG. 8 is a block diagram of an HBM in a semiconductor device inaccordance with an embodiment of the present disclosure. For example,the HBM 80 may include an interface (I/F) die 82 and a plurality of coredies 83 a to 83 h. In FIG. 8, a stack group 84 a includes four core diesCorel 83 a to Core4 83 d which have an SID “0.” A stack group 84 bincludes four core dies Core5 83 e to Core8 83 h which have an SID “1.”The I/F die 82 may include a plurality of input buffers Rx 821. The Rx821 may receive a clock signal CK_t, a plurality of row command/addresssignals R[5:0] and a plurality of column command/address signals C[7:0].The received signals R[5:0] and C[7:0] may be transmitted to a samplercircuit 822 a. The sampler circuit 822 a may capture the R[5:0] andC[7:0] signals by the CK_t signal and may further provide anintermediate IF command signal to an IF command circuit 823. The IFcommand circuit 823 may decode the intermediate IF command signal andmay further provide one or more interface input/output (IFIO) controlsignals. The IFIO control signals may be provided to a gating circuit824, a flip-flop (FF) circuit 826 and an IO control circuit 827.Responsive to the IFIO control signals, the gating circuit 824 mayprovide the clock signal from the 821 a, the row command/address signalsand the column command/address signals from the sampler circuit 822 a toa plurality of output buffers Tx 825. The plurality of output buffers Tx825 a, 825 b and 825 c may provide a clock signal CK_t_0, a plurality ofrow command/address signals R_0[5:0] and a plurality of columncommand/address signals C_0[7:0] which may have a delay relative to theCK_t, R[5:0] and C[7:0] signals. respectively via a conductive path 87a. For example, the delay corresponds to a delay caused by the IFcommand circuit 823. For example, the conductive path 87 a may bethrough-silicon vias (TSVs) (e.g., a portion of the input/output line 37a).

The I/F die 82 may also receive a write data strobe signal WDQS and datasignals DQ[127:0]. The received signals WDQS and DQ[127:0] may betransmitted to a sampler circuit 822 b. The sampler circuit 822 b maycapture the DQ[127:0] signals on both rising and falling edges of theWDQS for a write operation. The IO control circuit 827 controls thewrite operation and a read operation based on the IFIO control signalsfrom the IF command circuit 823. Responsive to IFIO control signalsrelated to the read operation or the write operation, the IO controlcircuit 827 receives a read and write clock signal rwclk_0 via an IOdriver 828. If the IF command is indicative of a command related to thewrite operation, the DQ[127:0] captured by the sampler circuit 822 b maybe transmitted to the IO control circuit 827 via the FF 826, and the IOcontrol circuit 827 may further provide data signals DQ_0[127:0] whichmay have the delay relative to the DQ[127:0] via the IO driver 828 tothe Corel die 83 a together with the read and write clock signalrwclk_0, responsive to the IFIO control signals. If the IF command isindicative of a command related to the read operation, the IO controlcircuit 827 receives the read and write clock signal rwclk_0 via an IOdriver 828 from the core die Corel 83 a. Responsive to IFIO controlsignals related to the read operation, the DQ_0[127:0] responsive to theread and write clock signal rwclk_0 may be transmitted to the IO controlcircuit 827 via the IO driver 828 from the core die Corel 83 a, and theIO control circuit 827 may further provide the data signals DQ[127:0] asa read out data, responsive to the IFIO control signals.

The core die Corel 83 a may include a plurality of input buffers Rx 831a. The Rx 831 a may receive the clock signal CK_t_0, a plurality of rowcommand/address signals R_0[5:0] and a plurality of columncommand/address signals C_0[7:0]. The received signals CK_t_0, R_0[5:0]and C_0[7:0] may be provided to a plurality of output buffers Tx 835 a,respectively, and the plurality of output buffers Tx 835 a may provideCK_t_1, R_1 [5:0] and C_0[7:0] to the core die Core5 83 e by driving aconductive path 87 b. For example, the conductive path 87 b may bethrough-silicon vias (TSVs) (e.g., a portion of the input/output line 37a). The received signals CK_t_0, R_0[5:0] and C_0[7:0] may be providedto a sampler 832 a. The sampler 832 a may capture the R_0[5:0] andC_0[7:0] signals by the CK_t_0 signal and may further provide anintermediate core command signal to a command circuit 833 a. The commandcircuit 833 a may decode the intermediate core command signal and mayobtain a core command. The command circuit 833 a may be provided with astack ID (SID) indicative of the stack group 84 a (e.g., the SID “0”).The command circuit 833 a compares the SID in the core command with theSID indicative of the stack group 84 a. If the two SIDs matches, thecommand circuit 833 a may execute a write operation or a read operationresponsive to the core command. For example, the command circuit 833 amay provide one or more core input/output (CIO) control signalsresponsive to the core command to a data control circuit 839 a and atleast one memory array 830 a on the core die Corel 83 a. Responsive tothe CIO control signals, the data control circuit 839 a may provide theread and write clock signal rwclk_0 from an IO driver 838 a to thememory array 830 a. The data control circuit 839 a controls the writeoperation and the read operation based on the CIO control signals fromthe command circuit 833 a. If the core command is indicative of acommand related to the write operation, the data control circuit 839 amay provide the DQ_0[127:0] received at the IO driver 838 a from the I/Fdie 82 to the memory array 830 a based on the read and write clocksignal rwclk_0. If the core command is indicative of a command relatedto the read operation, the data control circuit 839 a may read data fromthe memory array 830 a and provide the read data as the DQ_0[127:0] viathe IO driver 838 a responsive to the read and write clock signalrwclk_0, and the IO driver 838 a may further provide the data signalsDQ_0[127:0] as a read out data together with the read and write clocksignal rwclk_0 to the I/F die 82. In some embodiments, the IO driver 838a may further provide data signals DQ_1[127:0] and a read and writeclock signal rwclk_1 based on the received data signals DQ_0[127:0] andthe read and write clock signal rwclk_0, responsive to the SID in thecore command being different from the SID of the stack group 84 a in thewrite operation. In some embodiments, the IO driver 838 a may furtherprovide data signals DQ_1[127:0] and a read and write clock signalrwclk_based on the received data signals DQ_0[127:0] and the read andwrite clock signal rwclk_0 in the write operation, regardless of the SIDin the core command.

The core die Core5 83 e may include a plurality of input buffers Rx 831b. The Rx 831 b may receive the clock signal CK_t_1, a plurality of rowcommand/address signals R_[5:0] and a plurality of columncommand/address signals C_1 [7:0]. The received signals CK_t_1, R_1[5:0] and C_1[7:0] may be provided to a sampler 832 b. The sampler 832 bmay capture the R_1[5:0] and C_1 [7:0] signals by the CK_t_1 signal andmay further provide an intermediate core command signal to a commandcircuit 833 b. The command circuit 833 b may decode the intermediatecore command signal and may obtain a core command. The command circuit833 b may be provided with a stack ID (SID) indicative of the stackgroup 84 b (e.g., the SID “1”). The command circuit 833 b compares theSID in the core command with the SID indicative of the stack group 84 b.If the two SIDs matches, the command circuit 833 b may execute a writeoperation or a read operation responsive to the core command. Forexample, the command circuit 833 b may provide one or more coreinput/output (CIO) control signals responsive to the core command to adata control circuit 839 b and at least one memory array 830 b on thecore die Corel 83 e. Responsive to the CIO control signals, the datacontrol circuit 839 b may provide the read and write clock signalrwclk_1 from an IO driver 838 b to the memory array 830 b. The datacontrol circuit 839 b controls the write operation and the readoperation based on the CIO control signals from the command circuit 833b. If the core command is indicative of a command related to the writeoperation, the data control circuit 839 b may provide the DQ_1[127:0]received at the IO driver 838 b from the core die Corel 83 a to thememory array 830 b based on the read and write clock signal rwclk_1. Ifthe core command is indicative of a command related to the readoperation, the data control circuit 839 b may read data from the memoryarray 830 b and provide the read data responsive to the read and writeclock signal rwclk_1, and the IO driver 838 b may further provide thedata signals DQ_1[127:0] as a read out data together with the read andwrite clock signal rwclk_1 to the core die Corel 83 a.

FIG. 9 is a block diagram of a command control circuit on an I/F die 90of an HBM in a semiconductor device in accordance with an embodiment ofthe present disclosure. For example, the command control circuit 90 maybe implemented on the I/F die 82 shown in FIG. 8 in some embodiments.The command control circuit 90 may receive a column command/addresssignal C[0] and a clock signal CK_t. For example, the C[0] signal may beused to implement the C[0] signal of the plurality of columncommand/address signals C[7:0] in FIG. 8 and the CK_t may be used toimplement the CK_t signal in the FIG. 8. The C[0] signal may include anSID bit at a first falling edge as shown in FIG. 6. The command controlcircuit 90 may include a sampler 92, a command circuit 93 and a gatingsub circuit 94. The sampler 92, the command circuit 93 and the gatingsub circuit 94 may be included in the sampler circuit 822 a, the IFcommand circuit 823 and the gating circuit 824 in FIG. 8, respectively.The sampler 92 may include flip-flop circuits (FF) 921 and 922. The FF921 receives the C[0] signal at a data input and the CK_t signal at aclock input. The FF 921 captures the C[0] signal in response to a risingedge of the CK_t signal and provides the C0(Rise) signal according tothe C[0] signal until a next rising edge of the CK_t signal. The FF 922receives the C[0] signal at a data input and an inversed signal of theCK_t signal at a clock input. The FF 922 captures the C[0] signal inresponse to a falling edge of the CK_t signal and provides the C0(Fall)signal according to the C[0] signal until a next falling edge of theCK_t signal.

The command circuit 93 may include a command decoder 931 and a delaycircuit 932. The command decoder 931 may receive the C0(Rise) signal,the C0(Fall) signal and the CK_t signal and may further decode theC0(Rise) and C0(Fall) (and other column command/address signals, whichis not shown in FIG. 9) responsive to the CK_t signal and obtains acolumn command. The command decoder provides a clock enable signal ckenresponsive to the column command. For example, an active cken signal(e.g., having a logic high level) may be provided when the columncommand is related to functions other than CNOP. When the column commandis indicative of the function CNOP, the cken signal may not be active(e.g., having a logic low level). The delay circuit 932 delays an inputsignal by a decoding delay by the command decoder 931 for decoding.Thus, the delay circuit 932 provides a delayed clock signal ckdel havingthe decoding delay relative to the CK_t signal.

The gating sub circuit 94 may include a clock gate circuit CKG 941,flip-flop circuits (FF) 942 and 943, a composition circuit 944 and adelay circuit 945. The clock gate circuit CKG 941 receives the ckensignal at a data input and the ckdel signal at a clock input. The CKG941 provides the cken signal as an intermediate result signal (notshown) while the ckdel signal is not active (e.g., a logic low level)and a level of the intermediate result signal is maintained while theckdel signal is being active (e.g., a logic high level). The CKG 941provides an intermediate clock signal ckint which is a result of an ANDoperation of the intermediate result signal and the ckdel signal. The FF942 receives the C0(Rise) signal at a data input and the ckint signal ata clock input. The FF 942 captures the C0(Rise) signal in response to arising edge of the ckint signal and provides the C_0[0](Rise) signalaccording to the C[0] (Rise) signal to the composition circuit 944 untila next rising edge of the ckint signal. The FF 943 receives the C0(Fall)signal at a data input and an inversed signal of the ckint signal at aclock input. The FF 943 captures the C0(Fall) signal in response to afalling edge of the ckint signal and provides the C_0[0](Fall) signalaccording to the C0(Fall) signal to the composition circuit 944 until anext falling edge of the ckint signal. The composition circuit 944receives the C_0[0](Rise) signal and the C_0[0](Fall) signal andprovides a composite signal C_0[0]. The delay circuit 945 delays aninput signal by a composition delay by the composition circuit 944.Thus, the delay circuit 945 provides a core clock signal CK_t_0 havingthe composition delay relative to the ckint signal. Thus, the clock gatecircuit CKG 941 may provide the core clock signal CK_t_0 to core dieswhen the column command is related to functions other than CNOP, whereasthe clock gate circuit CKG 941 may terminate providing the core clocksignal CK_t_0 to core dies when the column command is indicative of theCNOP function.

The command control circuit 90 may also include combinations of asampler and a gating sub circuit for each of C[7:1] signals (not shown).The combination of the sampler and the gating sub circuit may be similarto the sampler 92 and the gating sub circuit 94, however, the gating subcircuits for the C[7:1] signals may not include a clock gate circuitthat is equivalent to the CKG 941, instead, obtain the ckint signal fromthe clock gate circuit CKG 941.

FIG. 10 is a timing diagram of clock related signals and column commandsignals in the command control circuit 90 in FIG. 9, in accordance withan embodiment of the present disclosure. As mentioned earlier, the C[0]signal may include an SID bit at the first falling edge as shown in FIG.6. The sampler 92 may capture the C[0] signal “R” at a rising edge of afirst clock cycle of the clock signal CK_t at time T0 and may providethe C0(Rise) signal indicative of the “R” from the FF 921 from the timeT0, until a next bit is captured. The sampler 92 may capture the SIDincluded in the C[0] signal at a falling edge of the first clock cycleof the CK_t signal at time T1′ and may provide the C0(Fall) signalindicative of the SID from the FF 922 from the time T1′, until a nextbit is captured. For example, the cken signal may be activated at aroundT1′, due to the decoding delay. According to the decoding delay, theckdel signal may be provided with the decoding delay relative to theCK_t signal from time T2′. In the gating circuit 94, the C0(Rise) signaland C0(Fall) signal are composited and provided as the C0[0] signal.According to the composition delay, CK_t_0 signal may be provided withthe composition delay relative to the ckdel signal from time T3′. Inthis example, the CK_t_0 signal has a delay “T3′-T0” (e.g., about oneclock cycle) from the CK_t signal, which is significantly longer thanthe propagation delay from the I/F die 32 to the Core 1 die 33 arepresented by “T2-T1” in FIG. 7 (e.g., about a quarter clock cycle).

FIG. 11 is a block diagram of a portion of a command control circuit 110on an I/F die of an HBM in a semiconductor device in accordance with anembodiment of the present disclosure. For example, the command controlcircuit 110 may be implemented on the I/F die 82 shown in FIG. 8 in someembodiments. The command control circuit 110 may receive columncommand/address signals C[7:0] and a clock signal CK_t. The commandcontrol circuit 110 may decode the column command/address signals C[7:0]and may further provide column command/address signals C_0[7:0] relatedto the column command/address signals C[7:0] and the CK_t_0 related tothe clock signal CK_t. Here, the command control circuit 110 may providethe column command/address signals C_0[7:0] including the SID in thecommand in a different clock cycle earlier than an originallytransmitted clock cycle (e.g., C[0] at a rising edge that is earlierthan at C[0] at a falling edge, in FIG. 6). The command control circuit110 may provide at least a portion of the column command/address signalsincluding command information in an order different from an order thatthe at least a portion of the column command/address signals arereceived. For example, the command control circuit 110 on the I/F die 82may receive first command information and may further receive secondcommand information including the SID after receiving the first commandinformation. The command control circuit 110 on the I/F die may providethe second command information and may further provide the first commandinformation after providing the second command information.

For example, FIG. 11 shows circuitry for processing C[0] and C[2]signals which may be used to implement the C[0] and C[2] signals of theplurality of column command/address signals C[7:0] in FIG. 8 and theCK_t may be used to implement the CK_t signal in the FIG. 8. The commandcontrol circuit 110 may include samplers 112 a and 112 b, a commanddecoder circuit 113 and gating circuits 114 a and 114 b. The samplers112 a and 112 b may be included in the sampler circuit 822 a. Thecommand decoder circuit 113 may be the IF command circuit 823. Thegating sub circuits 114 a and 114 b may be included in the gatingcircuit 824 in FIG. 8. FIGS. 12A and 12B are timing diagrams of clockrelated signals and column command signals in the command controlcircuit in FIG. 11, in accordance with an embodiment of the presentdisclosure. FIG. 12A provides the timing diagrams when the SID is “L”(or “0”) and a latency counter (which will be described in detail laterin this disclosure) is in the I/F die 82. FIG. 12B provides the timingdiagrams when the SID is “H” (or “I”) and the latency counter is in theI/F die 82.

The C[0] signal may include an SID bit at a first falling edge as shownin FIG. 6. The sampler 112 a may include flip-flop circuits (FF) 1121and 1122. The FF 1121 receives the C[0] signal at a data input and theCK_t signal at a clock input. The FF 1121 captures the C[0] signal inresponse to a rising edge of the CK_t signal and provides the C0(Rise)signal according to the C[0] signal until a next rising edge of the CK_tsignal. The FF 1122 receives the C[0] signal at a data input and aninversed signal of the CK_t signal at a clock input. The FF 1122captures the C[0] signal in response to a falling edge of the CK_tsignal and provides the C0(Fall) signal according to the C[0] signaluntil a next falling edge of the CK_t signal. For example, FIG. 12 Ashows that the C[0] signal includes column command (CC1) information ata rising edge of a first clock cycle.

The C[2] signal may include a parity bit PAR at a first falling edge asshown in FIG. 6. The sampler 112 b may include flip-flop circuits (FF)1123 and 1124. The FF 1123 receives the C[2] signal at a data input andthe CK_t signal at a clock input. The FF 1123 captures the C[2] signalin response to a rising edge of the CK_t signal and provides theC2(Rise) signal according to the C[2] signal until a next rising edge ofthe CK_t signal. The FF 1124 receives the C[2] signal at a data inputand an inversed signal of the CK_t signal at a clock input. The FF 1124captures the C[2] signal in response to a falling edge of the CK_tsignal and provides the C2(Fall) signal according to the C[2] signaluntil a next falling edge of the CK_t signal.

The command decoder circuit 113 may receive the C0(Rise), C0(Fall),C2(Rise) and C2(Fall) signals and the CK_t signal and may further decodethe C0(Rise), C0(Fall), C2(Rise) and C2(Fall) signals (and other columncommand/address signals, which is not shown in FIG. 11) responsive tothe CK_t signal to obtain a column command. For example, the commanddecoder circuit 113 may detect whether an access request represented bythe column command is addressed to a stack group with SID “0” (e.g., thestack group 84 a) or a stack group with SID “1” (e.g., the stack group84 b), responsive to the SID provided on the C0(Fall) signal. Thecommand decoder circuit 113 may activate a clock enable signal cken0(e.g., set to a logic high level) and may keep a clock enable signalcken1 deactivated (e.g., set to a logic low level) for a longer periodthan a transmission period of the command (e.g., two clock cycles),responsive to the SID “0” as shown in FIG. 12A. The command decodercircuit 113 may activate the cken1 signal and may keep the cken0 signaldeactivated for the longer period than the transmission period of thecommand, responsive to the SID “1” as shown in FIG. 12B. If the columncommand is indicative of the function CNOP and the C0(Rise), C0(Fall),C2(Rise) and C2(Fall) signals include CNOP information, such as CNOP1Rand CNOP1F information, responsive to the C[0] and C[2] signalsincluding CNOP1R and CNOP1F information in rising and falling edgesrespectively, then any of the cken0 signal and the cken1 signal which isactive may be deactivated as shown in FIGS. 12A and 12B. The commanddecoder circuit 113 may include a parity logic circuit 1131. The paritylogic circuit 1131 receives the parity bit PAR included in theC[2](Fall) signal, and executes a parity check with regards to thecolumn/address signals. As the parity check may be executed on thecommand decoder circuit 113 (e.g., the IF command circuit 823 on the I/Fdie 82 of the HBM 80), and not used in the core dies (e.g., the coredies 83 a to 83 h), the parity bit PAR may not be provided to the coredies.

The gating sub circuit 114 a may include flip-flop circuits (FF) 1141and 1143, a latch circuit 1142, a command output circuit 1144, a clockgate circuit CKG 1148 and a delay circuit 1149. The clock gate circuitCKG 1148 receives a clock enable signal cken at a data input and aninverted signal of the CK_t signal at a clock input. The cken signal isan output signal of an OR circuit 1147 which becomes active responsiveto one of the cken0 signal and the cken signal being active. Similarlyto the CKG 941, the CKG 1148 provides an inverted signal of the CK_tsignal as a ck00 signal while the cken signal is being active (e.g., alogic high level) and the level of the ck00 signal is maintainedinactive (e.g., a logic low level) while the cken signal is beinginactive. The delay circuit 1149 delays the ck00 signal by a processingdelay by the command output circuit 1144.

The C0(Fall) signal may be provided to an inverter 1145 and the invertedC0(Fall) signal may be provided to a NOR circuit 1146. The NOR circuit1146 provides an output signal responsive to the inverted C0(Fall)signal and the cken0 signal. A latch circuit 1142 receives the outputsignal of the NOR circuit 1147 at a data input and the ck00 signal at aclock input. The latch circuit 1142 provides the output signal of theNOR circuit 1147 as an output signal to the command output circuit 1144responsive to the ck00 signal being active (e.g., at a logic low level)and further maintains a logic level of the output signal to the commandoutput circuit 1144 while the ck00 signal is being inactive (e.g., at alogic high level). The C0(Rise) signal may be provided to a data inputof the FF 1141 and the ck00 signal may be provided to a clock input ofthe FF 1141. The FF 1141 captures the C0(Rise) signal in response to arising edge of the ck00 signal and provides a C0(Rtmp) signal accordingto the C0(Rise) signal to the FF 1143 until a next rising edge of theck00 signal. Thus, the FF 1141 may provide a delay of more than a halfclock cycle to the C0(Rtmp) signal relative to C0(Rise) signal. The FF1143 receives the C0(Rtmp) signal at a data input and an inversed signalof the ck00 signal at a clock input. The FF 1143 captures the C0(Rtmp)signal in response to a falling edge of the ck00 signal and provides anoutput signal to the command output circuit 1144 until a next fallingedge of the ck00 signal. The command output circuit 1144 receives theoutput signal of the latch circuit 1142 and the output signal of the FF1143, and provides a C_0[0] signal. Here, the command output circuit1144 provides the C_0[0] signal including command information of theoutput signal of the latch circuit 1142 originated from the C0(Fall)signal and command information of the output signal of the FF 1143originated from the C0(Rise) signal in this order, due to the delay ofthe FF 1141 on C0(Rtmp). Thus, the command information on the C0(Fall)signal is provided and the command information on the C0(Rise) signal isprovided after the command information on the C0(Fall) is provided. Inthis manner, the gating sub circuit may change (e.g., swap) atransmission order of a plurality of pieces of command informationacross two clock cycles (e.g., at a rising edge and at a falling edge)in the C_0[0] signal relative to the C[0] signal shown in FIGS. 12A and12B to transmit the SID at an earliest possible timing (e.g., at thefirst clock cycle of the two clock cycles).

The gating sub circuit 114 b may include flip-flop circuits (FF) 1151and 1153, a latch circuit 1152 and a command output circuit 1154. Alatch circuit 1152 receives the cken1 signal at a data input and theck00 signal at a clock input. The latch circuit 1152 provides the cken1signal as an output signal to the command output circuit 1154 responsiveto the active ck00 signal (e.g., at the logic low level) and furthermaintains a logic level of the output signal to the command outputcircuit 1144 while the ck00 signal is being inactive (e.g., at the logichigh level). The C2(Rise) signal may be provided to a data input of theFF 1151 and the ck00 signal may be provided to a clock input of the FF1151. The FF 1151 captures the C2(Rise) signal in response to a risingedge of the ck00 signal and provides a C2(Rtmp) signal according to theC2(Rise) signal to the FF 1153 until a next rising edge of the ck00signal. Thus, the FF 1151 may provide a delay of more than a half clockcycle to the C2(Rtmp) signal relative to C2(Rise) signal. The FF 1153receives the C2(Rtmp) signal at a data input and an inversed signal ofthe ck00 signal at a clock input. The FF 1153 captures the C2(Rtmp)signal in response to a falling edge of the ck00 signal and provides anoutput signal to the command output circuit 1154 until a next fallingedge of the ck00 signal. The command output circuit 1154 receives theoutput signal of the latch circuit 1152 and the output signal of the FF1153, and provides a C_0[2] signal. Here, the command output circuit1154 provides the C_0[2] signal including command information of theoutput signal of the latch circuit 1152 originated from the C2(Fall)signal and command information of the output signal of the FF 1153originated from the C2(Rise) signal in this order, due to the delay ofthe FF 1151 on C2(Rtmp). Thus, the command information on the C2(Fall)signal is provided and the command information on the C2(Rise) signal isprovided after the command information on the C2(Fall) is provided. Inthis manner, the gating sub circuit 114 b may change (e.g., swap) atransmission order of a plurality of pieces of command informationacross two clock cycles (e.g., at a rising edge and at a falling edge)in the C_0[2] signal relative to the C_0[2] signal shown in FIGS. 12Aand 12B to transmit the cken1 information at an earliest possible timing(e.g., at the first clock cycle of the two clock cycles). Because thePAR bit may not be used in the core dies, command information on thecken1 signal corresponding to the SID may be reflected on C_0[2] at arising edge in place of the PAR bit.

The command control circuit 110 may also include combinations of asampler and a gate circuit for each of C[7:3, 1] signals (not shown).The combination of the sampler and the gate circuit may be similar tothe sampler 112 b and the gating circuit 114 b, however, the gatingcircuits for the C[7:3, 1] signals may not include an FF equivalent tothe FF 1151. Unlike the C[2] and C[0] signals, the C[7:3, 1] signalswere provided without swapping an order of the information.

FIG. 13 is a block diagram of a portion of a core die in the HBM in asemiconductor device in accordance with an embodiment of the presentdisclosure. Description of components and signals corresponding tocomponents and signals included in FIG. 8 will not be repeated. Forexample, a core die Corel 133 a which has an SID “0” may be used as aCorel 83 a in FIG. 8. The core die Core 133 a may include a plurality ofinput buffers Rx 1331 which may receive the clock signal CK_t_0, aplurality of row command/address signals R_0[5:0] and a plurality ofcolumn command/address signals C_0[7:0], respectively. The receivedsignals CK_t_0, R_0[5:0] and C_0[7:0] may be provided from the pluralityof input buffers Rx 1331 to an output buffer (Tx) control circuit 1334.The Tx control circuit 1334 detects the SID included in the C_0[0] at arising edge of the CK_t_0 signal as shown in FIGS. 12A and 12B, andprovides clock signal CK_t_1, a plurality of row command/address signalsR_1[5:0] and a plurality of column command/address signals C_1 [7:0]through a plurality of output buffers Tx 1335 which drive the conductivepath 87 b (FIG. 8), responsive to the SID information being differentfrom the SID “0” (e.g., the SID “1”). If the SID information isindicative the SID “0” for the core die Corel 133 a, the Tx controlcircuit 1334 may stop providing the CK_t_1, R_1[5:0] and C_1[7:0]signals by refraining from driving the conductive path 87 b.

FIG. 14 is a block diagram of the Tx control circuit 1334 on the coredie 133 a in FIG. 13 in accordance with an embodiment of the presentdisclosure. The core die Core 1 133 a may include a plurality of inputbuffers Rx 1331 which may receive the clock signal CK_t_0 and aplurality of column command/address signals C_0[7:0], respectively. Forexample, the plurality of input buffers Rx 1331 may provide the receivedsignals CK_t_0 and C_0[7:0] as CK_Rx and C_Rx[7:0] signals to the Txcontrol circuit 1334, respectively.

For example, the Tx control circuit 1334 may include a FF 1403. Asmentioned earlier, the C_0[0] may include the SID at a rising edge ofthe first clock cycle as shown in FIGS. 12A and 12B. Thus, the C_Rx[0]signal may include the SID information at a rising edge of a first clockcycle of the CK_Rx. The FF 1403 may receive the C_Rx[0] signal at a datainput and the CK_Rx signal at a clock input. The FF 1403 captures theC_Rx[0] signal in response to a rising edge of the CK_Rx signal and aninverter 1404 receives an output signal from the FF 1403 and provides anSID_1 signal which is an inverted signal of the SID. A plurality of ORcircuits 1405 may receive C_Rx[7:0] and provide the C_Tx[7:0] responsiveto the SID_1 signal. Thus, the C_Tx[7:0] may be C_Rx[7:0] when the SID_1signal is at a logic low level indicating that the column command isprovided to an upper core die. The plurality of output buffers Tx 1335may receive the C_Tx[7:0] and may further drive the conductive path 87 b(e.g., 37 a in FIG. 4B) in order to provide C_1[7:0] to upper core dies.The C_Tx[7:0] may be set to a logic high level in order to refrain fromdriving the conductive path 87 b, when the SID_1 signal is at a logichigh level indicating that the column command is provided to the coredie Corel 133 a.

For example, the Tx control circuit 1334 may include a clock gatecircuit CKG 1401. The clock gate circuit CKG 1401 may receive the CK_Rxsignal at a clock input and the C_Rx[2] signal at a data input. Asmentioned earlier, the C_0[2] may include the cken1 signal at a risingedge of the first clock cycle as shown in FIGS. 12A and 12B. Thus, theC_Rx[2] signal may include the cken1 information indicative of whether astack group designated is at a current die (e.g., at a logic low level)or at an upper core die (e.g., at a logic high level), at the risingedge of the first clock cycle. The CKG 1401 provides C_Rx[2] signal asan intermediate result signal (not shown) while the CK_Rx signal is notactive (e.g., a logic low level) and a level of the intermediate resultsignal is maintained while the CK_Rx signal is being active (e.g., alogic high level). The CKG 1401 provides an intermediate clock signalCK_Int which is a result of an AND operation of the intermediate resultsignal and the CK_Rx signal. Thus, the CKG 1401 is opened to allow theCK_Int signal to be conveyed to the upper die when the stack groupindicated belongs to the upper die. For example, the CKG 1401 is openedresponsive to the C_Rx[2] signal having the logic high level. On theother hand, the CKG 1401 is closed to block the CK_Int signal from beingconveyed to the upper die when the stack group is associated with thecurrent die. For example, the CKG 1401 is closed responsive to theC_Rx[2] signal having the logic low level. A delay circuit 1402 receivesthe CK_Int signal and delays the CK_Int signal in order to provide anoutput clock signal CK_Tx. The output clock signal CK_Tx has a delayrelative to the CK_Rx signal where the delay corresponds to a delay ofthe C_Tx[7:0] signals relative to the C_Rx[7:0] signals. The delaycircuit 1402 provides a clock signal CK_Tx having the delay relative tothe CK_Rx signal. One buffer of the plurality of output buffers Tx 1335may receive the CK_Tx and provide CK_t_1 to upper core dies.

Thus, the Tx control circuit 1334 may be opened to allow the clocksignal CK_t_1 and the C_1[7:0] signals to be conveyed to the upper coredies, when the column command is related to a stack group of the uppercore dies, whereas the Tx control circuit 1401 may be closed to blockthe clock signal CK_t_1 and the C_1[7:0] signals from being conveyed toupper core dies by setting these signals to predetermined levels (e.g.,the logic low level for CK_t_1 signal, and the logic high level for theC_1 [7:0] signals) when the column command is related to a stack groupof lower core dies (e.g., the core die Corel 133 a).

The HBM 1 may support a “data read/write latency” function to adjust adelay of data to be read/written relative to a read/write command. Forexample, data write latency may be defined from a rising edge of a clocksignal on which the write command is issued to a rising edge of theclock signal from which a first byte of the data to be written isprovided. FIG. 15 is a timing diagram of clock signals, command signalsand data signals to be provided to a portion of dies in a writeoperation, in accordance with an embodiment of the present disclosure.For example, write latency WL may be three clock cycles (WL=3) in FIG.15. FIG. 16 is a block diagram of an HBM 160 in a semiconductor devicein accordance with an embodiment of the present disclosure. For example,the HBM 160 may be the HBM 1. An 1/F die 162 receives columncommand/address signals C[7:0] conveying a column command (e.g., a writecommand) at time T0 and provides C_0[7:0] to a core die Corel 163 a. TheI/F die 162 may also receive data signals DQ at time T3 with the WL. Forexample, the I/F die 162 may include a command decoder 164 a (e.g., inthe IF command circuit 823). Responsive to the write command, thecommand decoder 164 a may provide a control signal wrtcom with the WLfrom the write command. Responsive to the wrtcom signal, a clock signalrwclk_0 and data signals DQ_0 may be provided to a core die Corel 163 a.The clock signal rwclk_0 may be used while receiving the DQ_0 signals.For example, the core die Corel 163 a may include a command decoder 164b (e.g., in the command circuit 833 a). Responsive to the C_0[7:0]signals, the command decoder 164 b may provide a control signal wrtcomc.Another write command may follow immediately after the write command atT2, before time T3. The command decoder 164 b may capture an SID fromthe C_0[7:0] soon after at time T0, prior to receiving the DQ_0 signalsat around time T3.

The core die Corel 163 a may include a write SID counter 165 b. FIG. 17a block diagram of a write SID counter 170 in a core die, in accordancewith an embodiment of the present disclosure. The write SID counter 170which may function like a first-in-first-out (FIFO) memory may be usedas the write SID counter 165 b, for example. The write SID counter 170may include an input pointer 171 and an output pointer 172. The inputpointer 171 receives the control signal wrtcomc from a command decoder(e.g., the command decoder 164 b). The output pointer 172 receives therwclk_0 signal from the I/F die (e.g., the I/F die 162). As shown inFIG. 15, the wrtcomc signal may be activated for each write command attimes T0 and T2, thus the input pointer 171 may increase a count to 1responsive to the write command at time T0, and may further increase thecount to 2 responsive to the write command at time T2. As shown in FIG.15, the rwclk_0 signal may be activated with the WL at times T3 and T4.The output pointer 172 may increase a count from 1 to 2 at time T3 andmay further increase the count to 3 at time T4 responsive to the rwclk_0signal. The write SID counter 170 may include flip-flops 173 and 174.The flip-flops 173 receive an SID at a data input and an output signalof the input pointer 171 at a clock input. Responsive to the SIDindicative of a stack group of an upper core die (e.g., a core die Core5163 e), the flip-flops 173 may provide the SID at timings responsive tothe output signal of the input pointer 171. The flip-flops 174 receivean output signal of the flip-flops 173 at a data input and an outputsignal of the output pointer 172 at a clock input. Responsive to theoutput signal of the flip-flops 173, the flip-flops 174 may provide theoutput signal of the flip-flops 173 at timings responsive to the outputsignal of the output pointer 172. FIG. 18 is a timing diagram of clockrelated signals and column command signals in the HBM in FIG. 8, inaccordance with an embodiment of the present disclosure. Unlike in FIGS.12A and 12B, C_0[2] and C_1[2] signals in FIG. 18 convey the cken1 at alogic high level at rising edges of third and fourth clock cycles,CK_t_0 and CK_t_1, indicative of using a write SID counter in the coredies. Thus, the SID may be stored for the WL until the DQ_0 is receivedwhile receiving consecutive commands and transmissions of the C_1[7:0]signals, data signals DQ_1 and a clock signal rwclk_1 to the upper coredie may be determined responsive to the timely stored SID.

Logic levels of signals and logic gate combinations used in theembodiments described the above are merely examples. However, in otherembodiments, combinations of logic levels of signals and combinations oflogic gates other than those specifically described in the presentdisclosure may be used without departing from the scope of the presentdisclosure.

Although this invention has been disclosed in the context of certainpreferred embodiments and examples, it will be understood by thoseskilled in the art that the inventions extend beyond the specificallydisclosed embodiments to other alternative embodiments and/or uses ofthe inventions and obvious modifications and equivalents thereof. Inaddition, other modifications which are within the scope of thisinvention will be readily apparent to those of skill in the art based onthis disclosure. It is also contemplated that various combination orsub-combination of the specific features and aspects of the embodimentsmay be made and still fall within the scope of the inventions. It shouldbe understood that various features and aspects of the disclosedembodiments can be combined with or substituted for one another in orderto form varying mode of the disclosed invention. Thus, it is intendedthat the scope of at least some of the present invention hereindisclosed should not be limited by the particular disclosed embodimentsdescribed above.

1. An apparatus comprising: at least one stack group corresponding,respectively, to at least one stack identifier; and a first dieincluding a command circuit configured to receive command informationincluding one stack identifier of the at least one stack identifier inrelation to a falling edge of a first clock signal, the command circuitfurther configured to provide the command information including the onestack identifier in relation to a rising edge of a second clock signal.2. The apparatus of claim 1, wherein, before receiving the commandinformation, the first die is configured to receive other commandinformation, and wherein the first die is further configured to providethe other command information in relation to a falling edge of thesecond clock signal.
 3. The apparatus of claim 1, wherein the first dieis configured to receive other command information in relation to arising edge of the first clock signal.
 4. The apparatus of claim 1,wherein, before receiving the command information, the first die isconfigured to receive other command information, and wherein, afterproviding the command information, the first die is configured toprovide the other command information.
 5. The apparatus of claim 1,wherein, before receiving the command information, the first die isconfigured to receive other command information, and wherein the firstdie further includes: an input buffer; an output buffer; a samplercircuit configured to receive the other command information from theinput buffer and provide the other command information to the commandcircuit, the command circuit including a delay circuit configured todelay the other command information, the command circuit configured toprovide the other command information from the delay circuit to theoutput buffer.
 6. The apparatus of claim 1, wherein, before receivingthe command information, the first die is configured to receive othercommand information, and wherein the other command information isreceived in relation to the rising edge of the first clock signal and isdelayed to be provided in relation to the falling edge of the secondclock signal.
 7. The apparatus of claim 1, wherein, before receiving thecommand information, the first die is configured to receive othercommand information, and wherein the other command information isreceived by the first die in relation to a rising edge of the firstclock signal, and the other command information is delayed to beprovided after the command information received by the first die and inrelation to a falling edge of the second clock signal.
 8. The apparatusof claim 1, further comprising: a second die coupled to the first die;and a third die coupled to the second die, wherein, before receiving thecommand information, the first die is configured to receive othercommand information, wherein the second die is configured to receive thecommand information and the other command information from the first diewhen the second command information designates the third die as a die tobe accessed, and wherein the third die is configured to receive thecommand information and the other command information from the seconddie when the second command information designates the third die as thedie to be accessed.
 9. The apparatus of claim 1, further comprising: asecond die coupled to the first die; and a third die coupled to thesecond die, wherein, before receiving the command information, the firstdie is configured to receive other command information, wherein thesecond die is configured to receive the command information and theother command information from the first die when the second commandinformation designates the second die as a die to be accessed, andwherein the third die is configured to not receive the commandinformation and the other command information from the second die whenthe second command information designates the second die as the die tobe accessed.
 10. A method comprising: receiving, at a first die, firstcommand information; receiving, at the first die, second commandinformation after the first command information, the second commandinformation including a stack identifier; and providing, from the firstdie and to at least one stack group, the first command information afterthe second command information.
 11. The method of claim 10, wherein anorder of the first command information and the second commandinformation is reversed.
 12. The method of claim 10, wherein theproviding, from the first die and to the at least one stack group, thefirst command information after the second command information includes:reversing, by the first die, an order of the first command informationand the second command information; and providing, to a second die ofthe at least one stack group, the first command information after thesecond command information.
 13. The method of claim 10, furthercomprising: detecting, in the second command information, a destinationof the second command information; and providing, to a second diecoupled to the first die, the second command information if thedestination is the second die.
 14. The method of claim 10, furthercomprising: detecting, by a second die coupled to the first die, adestination of the second command information; and refraining fromproviding, to a third die coupled to the second die, the second commandinformation if the destination of the second command information is thesecond die.
 15. A method comprising: capturing, at an interface die,first command information responsive to a first edge of a first clocksignal; capturing, at the interface die, second command informationresponsive to a second edge of the first clock signal, the secondcommand information including a stack identifier, the second edgefollowing the first edge so that the interface die captures the secondcommand information after capturing the first command information;providing, from the interface die and to a plurality of core dies, thesecond command information responsive to a third edge of a second clocksignal; and providing, from the interface die and to the plurality ofcore dies, the first command information responsive to a fourth edge ofthe second clock signal, the fourth edge following the third edge sothat the interface die provides the second command information beforeproviding the first command information.
 16. The method of claim 15,wherein the first edge and the second edge of the first clock signal area rising edge and a falling edge of the first clock signal,respectively, and the third edge and the fourth edge of the second clocksignal are a rising edge and a falling edge of the second clock signal,respectively.
 17. The method of claim 15, further comprising delayingthe first clock signal to provide the second the clock signal.
 18. Themethod of claim 15, further comprising: capturing, at at least one ofthe plurality of core dies, the second command information responsive tothe third edge of the second clock signal; and capturing, at at leastone of the plurality of core dies, the first command informationresponsive to the fourth edge of the second clock signal.
 19. The methodof claim 15, wherein the first command information is captured at afirst input buffer of the interface die, and is subsequently captured ata second input buffer of at least one of the plurality of core dies. 20.The method of claim 19, wherein the second command information iscaptured at the first input buffer of the interface die, and wherein thesecond command information is subsequently captured at the second inputbuffer of the at least one of the plurality of core dies before thefirst command information is captured at the second input buffer.